Reproducible data collection

Aud Halbritter

Content

PFTC Courses

Design spreadsheet

Spreadsheet content

  • Date, time, observation number
  • Location: region/site
  • Experimental design: block, plot, replicate, number of observation, treatments
  • Organism: species/population/genet
  • Unique ID for sample/observation
  • Response
  • Predictors
  • METADATA: recorder/scribe, weather, notes

Design spreadsheet - manual

Design spreadsheet - digital

Design spreadsheet - data validation

Design spreadsheet - data validation

Spreadsheets

Discuss with your neighbour: are these good spreadsheets?

Spreadsheet - bad example

Long or wide format

Tidy data

Consistency

Consistency - meaningful names

Consistency - meaningful names

Which of these are meaningful names?

  • T

  • bird_raw

  • jja1b

  • mean

  • data

  • ddd

Consistency - meaningful names

  • A name can contain letters, numbers, dot and underscore

  • First letter must be letter or dot. If the first character in a name is a dot, the object is invisible.

  • Avoid special characters (e.g. æ, å, ø, ö)

  • Avoid reserved names: function, TRUE, mean, etc.

Consistency - style

Consistency - standards

Use global data standards when available.